Integrating Private Databases for Data Analysis

نویسندگان

  • Ke Wang
  • Benjamin C. M. Fung
  • Guozhu Dong
چکیده

In today’s globally networked society, there is a dual demand on both information sharing and information protection. A typical scenario is that two parties wish to integrate their private databases to achieve a common goal beneficial to both, provided that their privacy requirements are satisfied. In this paper, we consider the goal of building a classifier over the integrated data while satisfying the k-anonymity privacy requirement. The k-anonymity requirement states that domain values are generalized so that each value of some specified attributes identifies at least k records. The generalization process must not leak more specific information other than the final integrated data. We present a practical and efficient solution to this problem.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Collusion Set Detection Through Outlier Discovery

Digging in the details : a case study in network data mining p. 14 Efficient identification of overlapping communities p. 27 Event-driven document selection for terrorism information extraction p. 37 Link analysis tools for intelligence and counterterrorism p. 49 Mining candidate viruses as potential bio-terrorism weapons from biomedical literature p. 60 Private mining of association rules p. 7...

متن کامل

Semantic-JSON: a lightweight web service interface for Semantic Web contents integrating multiple life science databases

Global cloud frameworks for bioinformatics research databases become huge and heterogeneous; solutions face various diametric challenges comprising cross-integration, retrieval, security and openness. To address this, as of March 2011 organizations including RIKEN published 192 mammalian, plant and protein life sciences databases having 8.2 million data records, integrated as Linked Open or Pri...

متن کامل

An Intelligence-Based Model for Supplier Selection Integrating Data Envelopment Analysis and Support Vector Machine

The importance of supplier selection is nowadays highlighted more than ever as companies have realized that efficient supplier selection can significantly improve the performance of their supply chain. In this paper, an integrated model that applies Data Envelopment Analysis (DEA) and Support Vector Machine (SVM) is developed to select efficient suppliers based on their predicted efficiency sco...

متن کامل

Marketing Strategy Evaluation by Integrating Dynamic Systems Modeling and Network Data Envelopment Analysis

Nowadays, the service industries play an essential role in the economic development of countries, and among the various fields of insurance, life insurance is of particular importance because it covers its cover directly to humans. Increased competition in the insurance industry has led managers to seek marketing strategies that, in addition to increasing insurance sales, reduce costs and gain ...

متن کامل

Sme-internet Tracking Study Report

EDI-Electronic Data Interchange (EDI) is the electronic exchange of routine business transactions. Extranet – an extranet is a private network that uses the Internet protocol and the public telecommunication system to share securely part of a business's information or operations with suppliers, vendors, partners, customers, or other businesses. ICT – information and communication technology Int...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005